Analyzing Large Data Sets: rbcL 500 Revisited
نویسندگان
چکیده
منابع مشابه
Analyzing large data sets: rbcL 500 revisited.
In 1993, Mark Chase and 41 coauthors published phylogenetic analyses of two very large data sets of nucleotide sequences of the chloroplast gene rbcL, which encodes the large subunit of ribulose 1,5-bisphosphate carboxylase. Their paper was important for several reasons. These analyses were (and still are) among the largest ever attempted using parsimony. The assembly of such a large number of ...
متن کاملCollaboratively Analyzing Large Data Sets using Multitouch Surfaces
Copyright is held by the author/owner(s). CSCW’12, February 11–15, 2012, Seattle, Washington, USA. ACM 978-1-4503-0556-3/12/02. Abstract The analysis of large data sets is increasingly collaborative, multidisciplinary and even distributed. There are many advantages including numerous checks and balances on the results. But, even in highly distributed analytic tasks, it would be useful to concur...
متن کاملSets, Bags, and Rock and Roll: Analyzing Large Data Sets of Network Data
As network traffic increases, the problems associated with monitoring and analyzing the traffic on high speed networks become increasingly difficult. In this paper, we introduce a new conceptual framework based on sets of IP adresses, for coming to grips with this problem. The analytical techniques are described and illustrated with examples drawn from a dataset collected from a large operation...
متن کاملPixelMaps: A New Visual Data Mining Approach for Analyzing Large Spatial Data Sets
PixelMaps are a new pixel-oriented visual data mining technique for large spatial datasets. They combine kerneldensity-based clustering with pixel-oriented displays to emphasize clusters while avoiding overlap in locally dense point sets on maps. Because a full evaluation of density functions is prohibitively expensive, we also propose an efficient approximation, Fast-PixelMap, based on a synth...
متن کاملVisualising large data sets
Large data sets are different and new methods of display are needed for dealing with them. This paper reviews the standard problems in displaying large numbers of cases and variables, both continuous and categorical, and emphasises the need for improving current software. Much could be achieved by adding interactive tools to standard displays to provide greater flexibility and to facilitate a m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Systematic Biology
سال: 1997
ISSN: 1076-836X,1063-5157
DOI: 10.1093/sysbio/46.3.554